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1 GAATATGATG ACCCTAATGC 

51 GTCATTTGGA AGAACTGCAG 

101 ATGGAGAACA TGTATGCTAT 

151 AAATATATTG GTTTCACCTG 

201 AGAAGAAGAA AGCGTTGCAG 

251 TGGATGAAGA ATTTGTGGAA 

301 GATCTCAACG TTTTAAAGGC 

351 TGCCAAACAA GAAGATATTT 

401 GCCTAGAAGT GGAACGTGTA 

451 GACAATAAGG ATTGGAGAAT 

501 TGGAATTGAA TCTGCTCTAA 

551 ATAATGAAAT TACTAGGACT 

601 ATCAACAATC AGCCGGGAGC 

651 GTTAGGCTCA CTGTCTAGGC 

701 ATGGAAGCTC TTCCTGGACA 

751 AGAGAAATGA GCTTGGCTTG 

801 CCCTTCCCTG GGCTGGTGAA 

851 TTCCCTGCCA AAATGGTGTC 



AACAATATCT AACATACTAT CCGAGCTTCG 
ATTTTCCTCC TTCAAAATTA AAGTCAGGTT 
GTTCTTGATT GCTTCGCTGA AGAAGCATTG 
GAAAAGGCCA ATATACCCAG TAGAAGAATT 
AAGATGATGC AGAATTAACA TTAAATAAAG 
GAAGAGACAG ATAATGAAGA AAACTTTATT 
CCAGACATAT CACTTGGATA TGAACGAGAC 
TGGAATCCAC AACAGATGCT GCAGAATGGA 
CTACCGCAAC TGAAAGTCAC GATTAGGACT 
CCATGTTGAC CAAATGCACC AGCACAGAAG 
AGGAGACCAA. GGGATTTTTG GACAAACTCC 
TTGGAAAAGA TCAGCAGCCG AGAAAAGTAC 
CCATGGAGCA CTGTCCTCAG AGATGCGCAG 
CAGGCCCACC TTAGTCACTG TGGACTGGCA 
CACCTGCCCT AGCCCTCACC CTGGGGTGGA 
CAACTCAGAC CATTCCACGG AGGCATCCTC 
TAAAAGTTTC CTGAGGTCAA GGACTTCCTT 
CAGAACTTTG AGGCCAGAGG TGATCCAGTG 
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901 ATTTGGGAGC TGCAGGTCAC ACAGGCTGCT CAGAGGGCTG CTGAACAGGA 

951 TGTCCTCGGA CGACAGGCAC CTGGGCTCCA GCTGCGGCTC CTTCATCAAG 

1001 ACTGAGCCGT CCAGCCCGTC CTCGGGCATA GATGCCCTCA GCCACCACAG 

1051 CCCCAGTGGC TCGTCCGACG CCAGCGGCGG CTTTGGCCTG GCCCTGGGCA 

1101 CCCACGCCAA CGGTCTGGAC TCGCCACCCA TGTTTGCAGG CGCCGGGCTG 

g 1151 GGAGGCACCC CATGCCGCAA GAGCTACGAG GACTGTGCCA GCGGCATCAT 

5 1201 GGAGGACTCG GCCATCAAGT GCGAGTACAT GCTCAACGCC ATCCCCAAGC 

5 1251 GCCTGTGCCT CGTGTGCGGG GACATTGCCT CTGGCTACCA CTACGGCGTG 

s - 

I 1301 GCCTCCTGCG AGGCTTGCAA GGCCTTCTTC AAGAGGACTA TCCAAGGGAA 

M| 1351 CATTGAGTAC AGCTGCCCGG CCACCAACGA GTGCGAGATC ACCAAACGGA 

flj 

W 1401 GGCGCAAGTC CTGCCAGGCC TGCCGCTTCA TGAAATGCCT CAAAGTGGGG 

iy 1451 ATGCTGAAGG AAGGTGTGCG CCTTGATCGA GTGCGTGGAG GCCGTCAGAA 

1501 ATACAAGCGA CGGCTGGACT CAGAGAGCAG CCCATACCTG AGCTTACAAA 

1551 TTTCTCCACC TGCTAAAAAG CCATTGACCA AGATTGTCTC ATACCTACTG 

1601 GTGGCTGAGC CGGACAAGCT CTATGCCATG CCTCCCCCTG GTATGCCTGA 

1651 GGGGGACATC AAGGCCCTGA CCACTCTCTG TGACCTGGCA GACCGAGAGC 

1701 TTGTGGTCAT CATTGGCTGG GCCAAGCACA TCCCAGGCTT CTCAAGCCTC 

1751 TCCCTGGGGG ACCAGATGAG CCTGCTGCAG AGTGCCTGGA TGGAAATCCT 
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1801 


CATCCTGGGC 


ATCGTGTACC 


GCTCGCTGCC 


CTACGACGAC 


AAGCTGGTGT 


1851 


ACGCTGAGGA 


CTACATCATG 


GATGAGGAGC 


ACTCCCGCCT 


CGCGGGGCTG 


1901 


CTGGAGCTCT 


ACCGGGCCAT 


CCTGCAGCTG 


GTACGCAGGT 


ACAAGAAGCT 


1951 


CAAGGTGGAG 


AAGGAGGAGT 


TTGTGACGCT 


CAAGGCCCTG 


GCCCTCGCCA 


2001 


ACTCCGATTC 


CATGTACATC 


GAGGATCTAG 


AGGCTGTCCA 


GAAGCTGCAG 


2051 


GACCTGCTGC 


ACGAGGCACT 


GCAGGACTAC 


GAGCTGAGCC 


AGCGCCATGA 


2101 


GGAGCCCTGG 


AGGACGGGCA 


AGCTGCTGCT 


GACACTGCCG 


CTGCTGCGGC 


2151 


AGACGGCCGC 


CAAGGCCGTG 


CAGCACTTCT 


ATAGCGTCAA 


ACTGCAGGGC 


2201 


AAAGTGCCCA 


TGCACAAACT 


CTTCCTGGAG 


ATGCTGGAGG 


CCAAGGCCTG 


2251 


GGCCAGGGCT 


GACTCCCTTC 


AGGAGTGGAG 


GCCACTGGAG 


CAAGTGCCCT 


2301 


CTCCCCTCCA 


CCGAGCCACC 


AAGAGGCAGC 


ATGTGCATTT 


CCTAACTCCC 


2351 


TTGCCCCCTC 


CCCCATCTGT 


GGCCTGGGTG 


GGCACTGCTC 


AGGCTGGATA 


2401 


CCACCTGGAG 


GTTTTCCTTC 


CGCAGAGGGC 


AGGTTGGCCA 


AGAGCAGCTT 


2451 


AGAGGATCTC 


CCAAGGATGA 


AAGAATGTCA 


AGCCATGATG 


GAAAATGCCC 


2501 


CTTCCAATCA 


GCTGCCTTCA 


CAAGCAGGGA 


TCAGAGCAAC 


TCCCCGGGGA 


2551 


TCCCCAATCC 


ACGCCCTTCT 


AGTCCAACCC 


CCCTCAATGA 


GAGAGGCAGG 


2601 


CAGATCTCAC 


CCAGCACTAG 


GACACCAGGA 


GGCCAGGGAA 


AGCATCTCTG 


2651 


GCTCACCATG 


TAACATCTGG 


CTTGGAGCAA 


GTGGGTGTTC 


TGCACACCAG 


2701 


GCAGCTGCAC 


CTCACTGGAT 


CTAGTGTTGC 


TGCGAGTGAC 


CTCACTTCAG 


2751 


AGCCCCTCTA GCAGAGTGGG GCGGAAGTCC TGATGGTTGG TGTCCATGAG 


2801 


GTGGAAG (SEQ ID NO:l) 
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GAATATGATGACCCTAATGCAACAATATCTAACATACTATCCGAGCTTCGGTCATTTGGA 

1 + + y + + + 60 

CTTATACTACTGGGATTACGTTGTTATAGATTGTATGATAGGCTCGAAGCCAGTAAACCT 

AGAACTGCAGATTTTCCTCCTTCAAAATTAAAGTCAGGTTATGGAGAACATGTATGCTAT 

61 + + + + + + 120 

TCTTGACGTCTAAAAGGAGGAAGTTTTAATTTCAGTCCAATACCTCTTGTACATACGATA 

GTTCTTGATTGCTTCGCTGAAGAAGCATTGAAATATATTGGTTTCACCTGGAAAAGGCCA 

121 + + + + + + 180 

CAAGAACTAACGAAGCGACTTCTTCGTAACTTTATATAACCAAAGTGGACCTTTTCCGGT 

ATATACCCAGTAGAAGAATTAGAAGAAGAAAGCGTTGCAGAAGATGATGCAGAATTAACA 

|=§1 + + + + + + 240 

O TATATGGGTCATCTTCTTAATCTTCTTCTTTCGCAACGTCTTCTACTACGTCTTAATTGT 

III TTAAATAAAGTGGATGAAGAATTTGTGGAAGAAGAGACAGATAATGAAGAAAACTTTATT 

gi + + + + + + 300 

PI AATTTATTTCACCTACTTCTTAAACACCTTCTTCTCTGTCTATTACTTCTTTTGAAATAA 



GATCTCAACG7TTTAAAGGCCCAGACATATCACTTGGATATGAACGAGACTGCCAAACAA 

301 + + + + -r + 360 

O CTAGAGTTGCAAAATTTCCGGGTCTGTATAGTGAACCTATACTTGCTCTGACGGTTTG7T 

N : 

W GAAGATATTTTGGAATCCACAACAGATGCTGCAGAATGGAGCCTAGAAGTGGAACGTGTA 

|62 + + + + + + 420 

CTTCTATAAAACCTTAGGTGTTGTCTACGACGTCTTACCTCGGATCTTCACCTTGCACAT 

CTACCGCAACTGAAAGTCACGATTAGGACTGACAATAAGGATTGGAGAATCCATGTTGAC 

421 + + + + + + 480 

GATGGCGTTGACTTTCAGTGCTAATCCTGACTGTTATTCCTAACCTCTTAGGTACAACTG 

CAAATGCACCAGCACAGAAGTGGAATTGAATCTGCTCTAAAGGAGACCAAGGGATTTTTG 

4B1 + + + + + + 540 

GTTTACGTGGTCGTGTCTTCACCTTAACTTAGACGAGATTTCCTCTGGTTCCCTAAAAAC 
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GACAAACTCCATAATGAAATTACTAGGACTTTGGAAAAGATCAGCAGCCGAGAAAAGTAC 

+ + + + + + 600 

CTGTTTGAGGTATTACTTTAATGATCCTGAAACCTTTTCTAGTCGTCGGCTCTTTTCATG 

ATCAACAATCAGCCGGGAGCCCATGGAGCACTGTCCTCAGAGATGCGCAGGTTAGGCTCA 

+ + + h + + 660 

TAGTTGTTAGTCGGCCCTCGGGTACCTCGTGACAGGAGTCTCTACGCGTCCAATCCGAGT 

CTGTCTAGGCCAGGCCCACCTTAGTCACTGTGGACTGGCAATGGAAGCTCTTCCTGGACA 

+ + + + + + 720 

GACAGATCCGGTCCGGGTGGAATCAGTGACACCTGACCGTTACCTTCGAGAAGGACCTGT 

CACCTGCCCTAGCCCTCACCCTGGGGTGGAAGAGAAATGAGCTTGGCTTGCAACTCAGAC 

+ + + + + + 780 

GTGGACGGGATCGGGAGTGGGACCCCACCTTCTCTTTACTCGAACCGAACGTTGAGTCTG 

CATTCCACGGAGGCATCCTCCCCTTCCCTGGGCTGGTGAATAAAAGTTTCCTGAGGTCAA 

. + + + + + + 840 

GTAAGGTGCCTCCGTAGGAGGGGAAGGGACCCGACCACTTATTTTCAAAGGACTCCAGTT 

GGACTTCCTTTTCCCTGCCAAAATGGTGTCCAGAACTTTGAGGCCAGAGGTGATCCAGTG 

+ + + + + + 900 

CC TGAAGGAAAAGGGACGGTTTTACCACAGGTCTTGAAACTCC GGTCTCC ACTAGGTCAC 

ATTTGGGAGCTGCAGGTCACACAGGCTGCTCAGAGGGCTGCTGAACAGGATGTCCTCGGA 

+ + + + + + 960 

TAAACCCTCGACGTCCAGTGTGTCCGACGAGTCTCCCGACGACTTGTCCTACAGGAGCCT 

M S S D 



CGACAGGCACCTGGGCTCCAGCTGCGGCTCCTTCATCAAGACTGAGCCGTCCAGCCCGTC 

+ + + + + + 1020 

GCTGTCCGTGGACCCGAGGTCGACGCCGAGGAAGTAGTTCTGACTCGGCAGGTCGGGCAG 
DRHLGSSCGSFIKTEPSSPS 
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CTCGGGCATAGATGCCCTCAGCCACCACAGCCCCAGTGGCTCGTCCGACGCCAGCGGCGG 

1021 — * + + + + + + 1° 80 

GAGCCCGTATCTACGGGAGTCGGTGGTGTCGGGGTCACCGAGCAGGCTGCGGTCGCCGCC 
SGIDALSHHSPSGSSDASGG 

CTTTGGCCTGGCCCTGGGCACCCACGCCAACGGTCTGGACTCGCCACCCATGTTTGCAGG 

1061 + + + + + + H40 

GAAACCGGACCGGGACCCGTGGGTGCGGTTGCCAGACCTGAGCGGTGGGTACAAACGTCC 
FGLALGTHANGLDSPPMFAG 

U, CGCCGGGCTGGGAGGCACCCCATGCCGCAAGAGCTACGAGGACTGTGCCAGCGGCATCAT 

+ + + + + + 1200 

-5 GCGGCCCGACCCTCCGTGGGGTACGGCGTTCTCGATGCTCCTGACACGGTCGCCGTAGTA 
Tn AGLGGTPCRKSYEDCASGIM 

am 

m GGAGGACTCGGCCATCAAGTGCGAGTACATGCTCAACGCCATCCCCAAGCGCCTGTGCCT 

12 || + + + + + + 1260 

Z CCTCCTGAGCCGG7AGTTCACGCTCATGTACGAGTTGCGGTAGGGGTTCGCGGACACGGA 

1~ EDSAIKCEYMLNAIPKRL C L 

U CGTGTGCGGGGACATTGCCTCTGGCTACCACTACGGCGTGGCCTCCTGCGAGGCTTGCAA 

12 iji + + + + + + 1320 

i =1 GCACACGCCCCTGTAACGGAGACCGATGGTGATGCCGCACCGGAGGACGCTCCGAACGTT 

S VCGDIASGYHYGVASCEACK 

Hi 

3 W GGCCTTCTTCAAGAGGACTATCCAAGGGAACATTGAGTACAGCTGCCCGGCCACCAACGA 

132 i + + + + + + 1380 

CCGGAAGAAGTTCTCCTGATAGGT7CCCTTGTAACTCATGTCGACGGGCCGGTGGTTGCT 
AfFRRTIOGNIKYSCPATHl 

GTGCGAGATCACCAAACGGAGGCGCAAGTCCTGCCAGGCCTGCCGCTTCATGAAATGCCT 

13B1 + + + + + + 1440 

CACGCTCTAGTGGTTTGCCTCCGCGTTCAGGACGGTCCGGACGGCGAAGTACTTTACGGA 
C I Z T K RRRKSCQACRrMKCL 
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CAAAGTGGGGATGCTGAAGGAAGGTGTGCGCCTTGATCGAGTGCGTGGAGGCCGTCAGAA 

1441 + + + + + + 1500 

GTTTCACCCCTACGACTTCCTTCCACACGCGGAACTAGCTCACGCACCTCCGGCAGTCTT 
K V G M LKEGVRLDRVRGGRQK 

ATACAAGCGACGGCTGGACTCAGAGAGCAGCCCATACCTGAGCTTACAAATTTCTCCACC 

1501 + + + + + + 1560 

TATGTTCGCTGCCGACCTGAGTCTCTCGTCGGGTATGGACTCGAATGTTTAAAGAGGTGG 
Y K RRLDSESSPYLSLQI S P P 

TGCTAAAAAGCCATTGACCAAGATTGTCTCATACCTACTGGTGGCTGAGCCGGACAAGCT 

1561 + + + + + + 1620 

y, ACGATTTTTCGGTAACTGGTTCTAACAGAGTATGGATGACCACCGACTCGGCCTGTTCGA 
h AKKPLTKIVSYLLVAEPDKL 

m CTATGCCATGCCTCCCCCTGGTATGCCTGAGGGGGACATCAAGGCCCTGACCACTCTCTG 

isji + + + + + + 1680 

jjj GATACGGTACGGAGGGGGACCATACGGACTCCCCCTGTAGTTCCGGGACTGGTGAGAGAC 
> YAMPPPGMPEGDIKALTTLC 

u 

. TGACCTGGCAGACCGAGAGCTTGTGGTCATCATTGGCTGGGCCAAGCACATCCCAGGCTT 

2$g| + + + + + + 1740 

5 ACTGGACCGTCTGGCTCTCGAACACCAGTAGTAACCGACCCGGTTCGTGTAGGGTCCGAA 
f|| DLADRELVVI IGWA 'KHI PGF 

p CTCAAGCCTCTCCCTGGGGGACCAGATGAGCCTGCTGCAGAGTGCCTGGATGGAAATCCT 

17 II + + + + + + 1800 

GAGTTCGGAGAGGGACCCCCTGGTCTACTCGGACGACGTCTCACGGACCTACCTTTAGGA 
SSLSLGDQMSLLQSAWMEIL 

CATCCTGGGCATCGTGTACCGCTCGCTGCCCTACGACGACAAGCTGGTGTACGCTGAGGA 

2801 + + + + + + I860 

GTAGGACCCGTAGCACATGGCGAGCGACGGGATGCTGCTGTTCGACCACATGCGACTCCT 
ILGIVYRSLPYDDKLVYAED 
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CTACATCATGGATGAGGAGCACTCCCGCCTCGCGGGGCTGCTGGAGCTCTACCGGGCCAT 

1861 + + + + + + 

GATGTAGTACCTACTCCTCGTGAGGGCGGAGCGCC CCGACGACCTCGAGATGGCCC GGTA 
YIMDEEHSRLAGLLELYRAI 

CCTGCAGCTGGTACGCAGGTACAAGAAGCTCAAGGTGGAGAAGGAGGAGTTTGTGACGCT 

1921 + + + + + + 1980 

GGACGTCGACCATGCGTCCATGTTCTTCGAGTTCCACCTCTTCCTCCTCAAACACTGCGA 
LQLVRRYKKLKVEKEEFVTL 

CAAGGCCCTGGCCCTCGCCAACTCCGATTCCATGTACATCGAGGATCTAGAGGCTGTCCA 

29B1 + + + + + + 2040 

h& GTTCCGGGACCGGGAGCGGTTGAGGCTAAGGTACATGTAGCTCCTAGATCTCCGACAGGT 
O KALALANSDSMYIEDLEAVQ 

W GAAGCTGCAGGACCTGCTGCACGAGGCACTGCAGGACTACGAGCTGAGCCAGCGCCATGA 

2046. + + + + + + 2100 

Cft CTTCGACGTCCTGGACGACGTGCTCCGTGACGTCCTGATGCTCGACTCGGTCGCGGTACT 

*P KLQDLLHEALQDYELSQRHE 



Z s 



• GGAGCCCTGGAGGACGGGCAAGCTGCTGCTGACACTGCCGCTGCTGCGGCAGACGGCCGC 

2 m + + + + + + 2160 

I* CCTCGGGACCTCCTGCCCGTTCGACGACGACTGTGACGGCGACGACGCCGTCTGCCGGCG 
W EPWRTGKLLLTLPLLRQTAA 



5 ~#r 



O CAAGGCCGTGCAGCACTTCTATAGCGTCAAACTGCAGGGCAAAGTGCCCATGCACAAACT 

2 m + + + + + + 2220 

GTTCCGGCACG7CGTGAAGATATCGCAGTTTGACGTCCCGTTTCACGGGTACGTGTTTGA 
KAVQHFYSVKLQGKVPMHKL 

CTTCCTGGAGATGCTGGAGGCCAAGGCCTGGGCCAGGGCTGACTCCCTTCAGGAGTGGAG 

2221 + + + + + + 2280 

GAAGGACCTCTACGACCTCCGGTTCCGGACCCGGTCCCGACTGAGGGAAGTCCTCACCTC 
FLEMLEAKAWARADSLQEWR 
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GCCACTGGAGCAAGTGCCCTCTCCCCTCCACCGAGCCACCAAGAGGCAGCATGTGCATTT 

2281 + + + + + + 2340 

CGGTGACCTCGTTCACGGGAGAGGGGAGGTGGCTCGGTGGTTCTCCGTCGTACACGTAAA 
P LEQVPSPLHRATKRQHVHF 

CCTAACTCCCTTGCCCCCTCCCCCATCTGTGGCCTGGGTGGGCACTGCTCAGGCTGGATA 

2341 + + + + + + 2400 

GGATTGAGGGAACGGGGGAGGGGGTAGACACCGGACCCACCCGTGACGAGTCCGACCTAT 
LTPLPPPPSVAWVGTAQAGY 

CCACCTGGAGGTTTTCCTTCCGCAGAGGGCAGGTTGGCCAAGAGCAGCTTAGAGGATCTC 

2401 + + + + + + 2460 

hk GGTGGACCTCCAAAAGGAAGGCGTCTCCCGTCCAACCGGTTCTCGTCGAATCTCCTAGAG 

O HLEVFLPQRAGWPRAA* (SEQ ID N0:2) 

U1 CCAAGGATGAAAGAATGTCAAGCCATGATGGAAAATGCCCCTTCCAATCAGCTGCCTTCA 

2443. + + + + + + 2520 

CO GGTTCCTACTTTCTTACAGTTCGGTACTACCTTTTACGGGGAAGGTTAGTCGACGGAAGT 

CAAGCAGGGATCAGAGCAACTCCCCGGGGATCCCCAATCCACGCCCTTCTAGTCCAACCC 

2521 + + + + + + 2580 

Q GTTCGTCCCTAGTCTCGTTGAGGGGCCCCTAGGGGTTAGGTGCGGGAAGATCAGGTTGGG 

PJ CCCTCAATGAGAGAGGCAGGCAGATCTCACCCAGCACTAGGACACCAGGAGGCCAGGGAA 

25H& + + + + + + 2640 

O GGGAGTTACTCTCTCCGTCCGTCTAGAGTGGGTCGTGATCCTGTGGTCCTCCGGTCCCTT 

AGCATCTCTGGCTCACCATGTAACATCTGGCTTGGAGCAAGTGGGTGTTCTGCACACCAG 

2641 + + + + + + 2700 

TCGTAGAGACCGAGTGGTACATTGTAGACCGAACCTCGTTCACCCACAAGACGTGTGGTC 

GCAGCTGCACCTCACTGGATCTAGTGTTGCTGCGAGTGACCTCACTTCAGAGCCCCTCTA 

2701 + + + + + + 2760 

CGTCGACGTGGAGTGACCTAGATCACAACGACGCTCACTGGAGTGAAGTCTCGGGGAGAT 

GCAGAGTGGGGCGGAAGTCCTGATGGTTGGTGTCCATGAGGTGGAAG (SEQ ID NO:l) 

2761 + + + + 2807 

CGTCTCACCCCGCCTTCAGGACTACCAACCACAGGTACTCCACCTTC (SEQ ID N0:29) 
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MSSDDRHLGS SCGSFIKTEP SSPSSGIDAL SHHSPSGSSD ASGGFGLALG 
THANGLDSPP MFAGAGLGGT PCRKSYEDCA SGIMEDSAIK CEYMLNAIPK 
RLCLVCGDIA SGYHYGVASC EACK&TFKRT IQGWIEYSCP ATOECEITKR 
RRKSCQACKF MKCLKVaCLK EGVRLDRVRG GRQKYKRRLD SESSPYLSLQ 
ISPPAKKPLT KIVSYLLVAE PDKLYAMPPP GMPEGDIKAL TTLCDLADRE 
LWIIGWAKH IPGFSSLSLG DQMSLLQSAW MEILILGIVY RSLPYDDKLV 
YAEDYIMDEE HSRLAGLLEL YRAILQLVRR YKKLKVEKEE FVTLKALALA 
NSDSMYIEDL EAVQKLQDLL HEALQDYELS QRHEEPWRTG KLLLTLPLLR 
QTAAKAVQHF YSVKLQGKVP MHKLFLEMLE AKAWARADSL QEWRPLEQVP 
SPLHRATKRQ HVHFLTPLPP PPSVAWVGTA QAGYHLEVFL PQRAGWPRAA 
(SEQ ID N0:2) 
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1 GCGGGCCGCC AGTGTGGTGG AATTCGGCTT GTCACTAGGA GAACATTTGT 
51 GTTAATTGCA CTGTGCTCTG TCAAGGAAAC TTTGATTTAT AGCTGGGGTG 
101 CACAAATAAT GGTTGCCGGT CGCACATGGA TTCGGTAGAA CTTTGCCTTC 
151 CTGAATCTTT TTCCCTGCAC TACGAGGAAG AGCTTCTCTG CAGAATGTCA 
201 AACAAAGATC GACACATTGA TTCCAGCTGT TCGTCCTTCA TCAAGACGGA 
251 ACCTTCCAGC CCAGCCTCCC TGACGGACAG CGTCAACCAC CACAGCCCTG 
301 GTGGCTCTTC AGACGCCAGT GGGAGCTACA GTTCAACCAT GAATGGCCAT 
351 CAGAACGGAC TTGACTCGCC ACCTCTCTAC CCTTCTGCTC CTATCCTGGG 
401 AGGTAGTGGG CCTGTCAGGA AACTGTATGA TGACTGCTCC AGCACCATTG 
451 TTGAAGATCC CCAGACCAAG TGTGAATACA TGCTCAACTC GATGCCCAAG 
501 AGACTGTGTT TAGTGTGTGG TGACATCGCT TCTGGGTACC ACTATGGGGT 
551 AGCATCATGT GAAGCCTGCA AGGCATTCTT CAAGAGGACA ATTCAAGGCA' 
601 ATATAGAATA CAGCTGCCCT GCCACGAATG AATGTGAAAT CACAAAGCGC 
651 AGACGTAAAT CCTGCCAGGC TTGCCGCTTC ATGAAGTGTT TAAAAGTGGG 
701 CATGCTGAAA GAAGGGGTGC GTCTTGACAG AGTACGTGGA GGTCGGCAGA 
751 AGTACAAGCG CAGGATAGAT GCGGAGAACA GCCCATACCT GAACCCTCAG 
801 CTGGTTCAGC CAGCCAAAAA GCCATATAAC AAGATTGTCT CACATTTGTT 
851 GGTGGCTGAA CCGGAGAAGA TCTATGCCAT GCCTGACCCT ACTGTCCCCG 
901 ACAGTGACAT CAAAGCCCTC ACTACACTGT GTGACTTGGC CGACCGAGAG 
951 TTGGTGGTTA TCATTGGATG GGCGAAGCAT ATTCCAGGCT TCTCCACGCT 
1001 GTCCCTGGCG GACCAGATGA GCCTTCTGCA GAGTGCTTGG ATGGAAATTT 
1051 TGATCCTTGG TGTCGTATAC CGGTCTCTTT CATTTGAGGA TGAACTTGTC 
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1101 TATGCAGACG ATTATATAAT GGACGAAGAC CAGTCCAAAT TAGCAGGCCT 

1151 TCTTGATCTA AATAATGCTA TCCTGCAGCT GGTAAAGAAA TACAAGAGCA 

1201 TGAAGCTGGA AAAAGAAGAA TTTGTCACCC TCAAAGCTAT AGCTCTTGCT 

1251 AATTCAGACT CCATGCACAT AGAAGATGTT GAAGCCGTTC AGAAGCTTCA 

1301 GGATGTCTTA CATGAAGCGC TGCAGGATTA TGAAGCTGGC CAGCACATGG 

1351 AAGACCCTCG TCGAGCTGGC AAGATGCTGA TGACACTGCC ACTCCTGAGG 

1401 CAGACCTCTA CCAAGGCCGT GCAGCATTTC TACAACATCA AACTAGAAGG 

1451 CAAAGTCCCA ATGCACAAAC TTTTTTTGGA AATGTTGGAG GCCAAGGTCT 

1501 GACTAAAAGC TCCCTGGGCC TTCCCATCCT TCATGTTGAA AAAGGGAAAA 

1551 TAAACCCAAG AGTGATGTCG AAGAAACTTA GAGTTTAGTT AACAACATCA 

1601 AAAATCAACA GACTGCACTG ATAATTTAGC AGCAAGACTA TGAAGCAGCT 

1651 TTCAGATTCC TCCATAGGTT CCTGATGAGT TCTTTCTACT TTCTCCATCA 

1701 TCTTCTTTCC TCTTTCTTCC CACATTTCTC TTTCTCTTTA TTTTTTCTCC 

1751 TTTTCTTCTT TCACCTCCCT TATTTCTTTG CTTCTTTCAT TCCTAGTTCC 

1801 CATTCTCCTT TATTTTCTTC CCGTCTGCCT GCCTTCTTTC TTTTCTTTAC 

1851 CTACTCTCAT TCCTCTCTTT TCTCATCCTT CCCCTTTTTT CTAAATTTGA 

1901 AATAGCTTTA GTTTAAAAAA AAAAATCCTC CCTTCCCCCT TTCCTTTCCC 

1951 TTTCTTTCCT TTTTCCCTTT CCTTTTCCCT TTCCTTTCCT TTCCTCTTGA 

2001 CCTTCTTTCC ATCTTTCTTT TTCTTCCTTC TGCTGCTGAA CTTTTAAAAG 

2051 AGGTCTCTAA CTGAAGAGAG ATGGAAGCCA GCCCTGCCAA AGGATGGAGA 

2101 TCCATAATAT GGATGCCAGT GAACTTATTG TGAACCATAC CGTCCCCAAT 
2151 GACTAAGGAA TCAAAGAGAG AGAACCAACG TTCCTAAAAG TACAGTGCAA 
2201 CATATACAAA TTGACTGAGT GCAGTATTAG ATTTCATGGG AGCAGCCTCT 
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2251 


AATTAGACAA CTTAAGCAAC GTTGCATCGG CTGCTTCTTA TCATTGCTTT 




2301 


TCCATCTAGA TCAGTTACAG CCATTTGATT CCTTAATTGT TTTTTCAAGT 




2351 


CTTCCAGGTA TTTGTTAGTT TAGCTACTAT GTAACTTTTT CAGGGAATAG 




2401 


TTTAAGCTTT ATTCATTCAT GCAATACTAA AGAGAAATAA GAATACTGCA 




2451 


ATTTTGTGCT GGCTTTGAAC AATTACGAAC AATAATGAAG GACAAATGAA 




2501 


TCCTGAAGGA AGATTTTTAA AAATGTTTTG TTTCTTCTTA CAAATGGAGA 




2551 


TTTTTTTGTA CCAGCTTTAC CACTTTTCAG CCATTTATTA ATATGGGAAT 


w$ 


2601 


TTAACTTACT CAAGCAATAG TTGAAGGGAA GGTGCATATT ATCACGGATG 




2651 


CAATTTATGT TGTGTGCCAG TCTGGTCCCA AACATCAATT TCTTAACATG 


jr: 


2701 


AGCTCCAGTT TACCTAAATG TTCACTGACA CAAAGGATGA GATTACACCT 




2751 


ACAGTGACTC TGAGTAGTCA CATATATAAG CACTGCACAT GAGATATAGA' 




2801 


TCCGTAGAAT TGTCAGGAGT bLAttiLlti A^iioubAVju xvj^v. 


5 If 


2851 


ATATGATTTC TAGCTGCCAT GGTGGTTAGG AATGTGATAC TGCCTGTTTG 


o 
ru 


2901 


CAAAGTCACA GACCTTGCCT CAGAAGGAGC TGTGAGCCAG TATTCATTTA 




2951 


AGAGAATTCC ACCACACTGG CGGCCCGCGC TTGAT (SEQ ID NO: 3) 
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GCGGGCCGCCAGTGTGGTGGAATTCGGCTTGTCACTAGGAGAACATTTGTGTTAATTGCA 

1 + + + 4 + + 60 

CGCCCGGCGGTCACACCACCTTAAGCCGAACAGTGATCCTCTTGTAAACACAATTAACGT 

CTGTGCTCTGTCAAGGAAACTTTGATTTATAGCTGGGGTGCACAAATAATGGTTGCCGGT 

61 + + 4 + + + 120 

GACACGAGACAGTTCCTTTGAAACTAAATATCGACCCCACGTGTTTATTACCAACGGCCA 

CGCACATGGATTCGGTAGAACTTTGCCTTCCTGAATCTTTTTCCCTGCACTACGAGGAAG 

12 i + + + + + + 180 

GCGTGTACCTAAGCCATCTTGAAACGGAAGGACTTAGAAAAAGGGACGTGATGCTCCTTC 
MDSVELCLPESFSLHYEEE 

AGCTTCTCTGCAGAATGTCAAACAAAGATCGACACATTGATTCCAGCTGTTCGTCCTTCA 

181 + + + + +— "+ 240 

H TCGAAGAGACGTCTTACAGTTTGTTTCTAGCTGTGTAACTAAGGTCGACAAGCAGGAAGT 
5 LLCRMSNKDRHIDSSCSSFI 



TCAAGACGGAACCTTCCAGCCCAGCCTCCCTGACGGACAGCGTCAACCACCACAGCCCTG 

Hi + + 4 + + + 300 

-P AGTTCTGCCTTGGAAGGTCGGGTCGGAGGGACTGCCTGTCGCAGTTGGTGGTGTCGGGAC 
>fA KTEPSSPASLTDSVNHHSPG- 

GTGGCTCTTCAGACGCCAGTGGGAGCTACAGTTCAACCATGAATGGCCATCAGAACGGAC 

aoi + + + — + + + 360 

hi CACCGAGAAGTCTGCGGTCACCCTCGATGTCAAGTTGGTACTTACCGGTAGTCTTGCCTG 
p GSSDASGSYSSTMNGHQNGL 

TTGACTCGCCACCTCTCTACCCTTCTGCTCCTATCCTGGGAGGTAGTGGGCCTGTCAGGA 

361 + + + 4 -+ + 420 

AACTGAGCGGTGGAGAGATGGGAAGACGAGGATAGGACCCTCCATCACCCGGACAGTCCT 
DSPPLYPSAPILGGSGPVRK 

AACTGTATGATGACTGCTCCAGCACCATTGTTGAAGATCCCCAGACCAAGTGTGAATACA 

421 + + + + + + 480 

TTGACATACTACTGACGAGGTCGTGGTAACAACTTCTAGGGGTCTGGTTCACACTTATGT 

LYDDCSSTIVEDPQTKCEYM 

TGCTCAACTCGATGCCCAAGAGACTGTGTTTAGTGTGTGGTGACATCGCTTCTGGGTACC 

481 + 4 4 + + + 540 

ACGAGTTGAGCTACGGGTTCTCTGACACAAATCACACACCACTGTAGCGAAGACCCATGG 
LNSMPKRL CLVCGDIASGYH 
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ACTATGGGGTAGCATCATGTGAAGCCTGCAAGGCATTCTTCAAGAGGACAATTCAAGGCA 

541 + + + + + + 600 

TGATACCCCATCGTAGTACACTTCGGACGTTCCGTAAGAAGTTCTCCTGTTAAGTTCCGT 
YGVASCgACKAFFKRTIQGN 

ATATAGAATACAGCTGCCCTGCCACGAATGAATGTGAAATCACAAAGCGCAGACGTAAAT 

601 — • + + + + + + 660 

TATATCTTATGTCGACGGGACGGTGCTTACTTACACTTTAGTGTTTCGCGTCTGCATTTA 
IgYSCFATKECEITKRRRKS 

CCTGCCAGGCTTGCCGCTTCATGAAGTGTTTAAAAGTGGGCATGCTGAAAGAAGGGGTGC 

661 , — + + + + 4 + 720 

GGACGGTCCGAACGGCGAAGTACTTCACAAATTTTCACCCGTACGACTTTCTTCCCCACG 
CQACRTMKCLKVGM L K E G V R 

p GTCTTGACAGAGTACGTGGAGGTCGGCAGAAGTACAAGCGCAGGATAGATGCGGAGAACA 

•jjl + + + + + + 780 

m CAGAACTGTCTCATGCACCTCCAGCCGTCTTCATGTTCGCGTCCTATCTACGCCTCTTGT 

£ LDRVRGGRQKYKRRI DAENS 

m 

*P GCCCATACCTGAACCCTCAGCTGGTTCAGCCAGCCAAAAAGCCATATAACAAGATTGTCT 

f"8l + + — + + + + ' 840 

.L CGGGTATGGACTTGGGAGTCGACCAAGTCGGTCGGTTTTTCGGTATATTGTTCTAACAGA 
H PYLNPQLVQPAKKPYNKIVS 

si s s 

y CACATTTGTTGGTGGCTGAACCGGAGAAGATCTATGCCATGCCTGACCCTACTGTCCCCG 

gl + + + + + + 900 

Si GTGTAAACAACCACCGACTTGGCCTCTTCTAGATACGGTACGGACTGGGATGACAGGGGC 
HLLVAEPEKIYAMPDPTVPD 

ACAGTGACATCAAAGCCCTCACTACACTGTGTGACTTGGCCGACCGAGAGTTGGTGGTTA 

901 + + + + + + 960 

TGTCACTGTAGTTTCGGGAGTGATGTGACACACTGAACCGGCTGGCTCTCAACCACCAAT 
SDIKALTTLCDLADRELVVI 

TCATTGGATGGGCGAAGCATATTCCAGGCTTCTCCACGCTGTCCCTGGCGGACCAGATGA 

96! + + + + + + 1020 

AGTAACCTACCCGCTTCGTATAAGGTCCGAAGAGGTGCGACAGGGACCGCCTGGTCTACT 
1GWAKHIPGFSTLSLADQMS 

GCCTTCTGCAGAGTGCTTGGATGGAAATTTTGATCCTTGGTGTCGTATACCGGTCTCTTT 

1021 + + + + + + 1080 

CGGAAGACGTCTCACGAACCTACCTTTAAAACTAGGAACCACAGCATATGGCCAGAGAAA 
LLQSAWMEILILGVVYRSLS 
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C ATTTGAGGATGAACTTGTCTATGCAGACGATT ATATAAT GGACGAAGACCAG T CCAAAT 

1081 + + + + + + 1140 

GTAAACT C CTACTTGAACAGAT ACGTCTGCTAATATATTACCT GCT TCTGGT CAGGTTT A 
FEDELVYADDY IMDEDQSKL 

TAGCAGGCCTTCTTGATCTAAATAATGCTATCCTGCAGCTGGTAAAGAAATACAAGAGCA 

H41 . + + + + + + 1200 

ATCGTCCGGAAGAACTAGATTTATTACGATAGGACGTCGACCATTTCTTTATGTTCTCGT 
AGLLDLNNAI LQLVKKYKSM 

TGAAGCTGGAAAAAGAAGAATTTGTCACCCTCAAAGCTATAGCTCTTGCTAATTCAGACT 

12 01 + + + + + + 1260 

ACTTCGACCTTTTTCTTCTTAAACAGTGGGAGTTTCGATATCGAGAACGATTAAGTCTGA 
KLEKEEFVTLKAIALANSDS 

Li ccatgcacatagaagatgttgaagccgttcagaagcttcaggatgtcttacatgaagcgc 

1®61 + + + + + + 1320 

O GGTACGTGTATCTTCTACAACTTCGGCAAGTCTTCGAAGTCCTACAGAATGTACTTCGCG 



MHIEDVEAVQKLQDVLHEAL 



TGCAGGATTATGAAGCTGGCCAGCACATGGAAGACCCTCGTCGAGCTGGCAAGATGCTGA 

^§21 + + + + ~ + + , 1380 

ACGTCCTAATACTTCGACCGGTCGTGTACCTTCTGGGAGCAGCTCGACCGTTCTACGACT 

QDYEAGQHMEDPRRAGKMLM 



nj TGACACTGCCACTCCTGAGGCAGACCTCTACCAAGGCCGTGCAGCATTTCTACAACATCA 

X$S1 + + + + + + 1440 

CI ACTGTGACGGTGAGGACTCCGTCTGGAGATGGTTCCGGCACGTCGTAAAGATGTTGTAGT 

fU TLPLLRQTSTKAVQHFYNIK 

AACTAGAAGGCAAAGTCCCAATGCACAAACTTTTTTTGGAAATGTTGGAGGCCAAGGTCT 

1441 + + +— -+ + + 1500 

TTGATCTTCCGTTTCAGGGTTACGTGTTTGAAAAAAACCTTTACAACCTCCGGTTCCAGA 
LEGKVPMHKLFLEMLEAKV* 
(SEQ ID NO:4) 

GACTAAAAGCTCCCTGGGCCTTCCCATCCTTCATGTTGAAAAAGGGAAAATAAACCCAAG 

150 i + + + + + + 1560 

CTGATTTTCGAGGGACCCGGAAGGGTAGGAAGTACAACTTTTTCCCTTTTATTTGGGTTC 

AGTGATGTCGAAGAAACTTAGAGTTTAGTTAACAACATCAAAAATCAACAGACTGCACTG 

1561 + + + + + + 1620 

TCACTACAGCTTCTTTGAATCTCAAATCAATTGTTGTAGTTTTTAGTTGTCTGACGTGAC 

ATAATTTAGCAGCAAGACTATGAAGCAGCTTTCAGATTCCTCCATAGGTTCCTGATGAGT 

162 i + + + 4 + + 1680 

TATTAAATCGTCGTTCTGATACTTCGTCGAAAGTCTAAGGAGGTATCCAAGGACTACTCA 
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TCTTTCTACTTTCTCCATCATCTTCTTTCCTCTTTCTTCCCACATTTCTCTTTCTCTTTA 

1681 + + + 4 + + 1740 

AGAAAGATGAAAGAGGTAGTAGAAGAAAGGAGAAAGAAGGGTGTAAAGAGAAAGAGAAAT 

TTTTTTCTCCTTTTCTTCTTTCACCTCCCTTATTTCTTTGCTTCTTTCATTCCTAGTTCC 

1741 . + + + + + + 1800 

AAAAAAGAGGAAAAGAAGAAAGTGGAGGGAATAAAGAAACGAAGAAAGTAAGGATCAAGG 

CATTCTCCTTTATTTTCTTCCCGTCTGCCTGCCTTCTTTCTTTTCTTTACCTACTCTCAT 

1801 + + + — + + + I860 

GTAAGAGGAAATAAAAGAAGGGCAGACGGACGGAAGAAAGAAAAGAAATGGATGAGAGTA 

TCCTCTCTTTTCTCATCCTTCCCCTTTTTTCTAAATTTGAAATAGCTTTAGTTTAAAAAA 

2361 + + + + + + 1920 

U AGGAGAGAAAAGAGTAGGAAGGGGAAAAAAGATTTAAACTTTATCGAAATCAAATTTTTT 

O 

O AAAAATCCTCCCTTCCCCCTTTCCTTTCCCTTTCTTTCCTTTTTCCCTTTCCTTTTCCCT 

jjfl! + + + 4 + t + 1980 

j~ TTTTTAGGAGGGAAGGGGGAAAGGAAAGGGAAAGAAAGGAAAAAGGGAAAGGAAAAGGGA 

-P TTCCTTTCCTTTCCTCTTGACCTTCTTTCCATCTTTCTTTTTCTTCCTTCTGCTGCTGAA . 

illl + + + + + + 2040 

% AAGGAAAGGAAAGGAGAACTGGAAGAAAGGTAGAAAGAAAAAGAAGGAAGACGACGACTT 



n I CTTTTAAAAGAGGTCTCTAACTGAAGAGAGATGGAAGCCAGCCCTGCCAAAGGATGGAGA 

2gi + + + + + + 2100 

6 GAAAATTTTCTCCAGAGATTGACTTCTCTCTACCTTCGGTCGGGACGGTTTCCTACCTCT 

TCCATAATATGGATGCCAGTGAACTTATTGTGAACCATACCGTCCCCAATGACTAAGGAA 

2ioi + + + + + + 2160 

AGGTATTATACCTACGGTCACTTGAATAACACTTGGTATGGCAGGGGTTACTGATTCCTT 

TCAAAGAGAGAGAACCAACGTTCCTAAAAGTACAGTGCAACATATACAAATTGACTGAGT 

2161 + + + + + + 2220 

AGTTTCTCTCTCTTGGTTGCAAGGATTTTCATGTCACGTTGTATATGTTTAACTGACTCA 

GCAGTATTAGATTTCATGGGAGCAGCCTCTAATTAGACAACTTAAGCAACGTTGCATCGG 

2221 + + + + 4 — + 2280 

CGTCATAATCTAAAGTACCCTCGTCGGAGATTAATCTGTTGAATTCGTTGCAACGTAGCC 

CTGCTTCTTATCATTGCTTTTCCATCTAGATCAGTTACAGCCATTTGATTCCTTAATTGT 

2281 + + + + + + 2340 

GACGAAGAATAGTAACGAAAAGGTAGATCTAGTCAATGTCGGTAAACTAAGGAATTAACA 
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TTTTTCAAGTCTTCCAGGTATTTGTTAGTTTAGCTACTATGTAACTTTTTCAGGGAATAG 

_. + + + + + + 

AAAAAGTTCAGAAGGTCCATAAACAATCAAATCGATGATACATTGAAAAAGTCCCTTATC 



TTTAAGCTTTATTCATTCATGCAATACTAAAGAGAAATAAGAATACTGCAATTTTGTGCT 

2401 + + + + + — + 2460 

AAATTCGAAATAAGTAAGTACGTTATGATTTCTCTTTATTCTTATGACGTTAAAACACGA 

GGCTTTGAACAATTACGAACAATAATGAAGGACAAATGAATCCTGAAGGAAGATTTTTAA 

2461 + + + + + + 2520 

CCGAAACTTGTTAATGCTTGTTATTACTTCCTGTTTACTTAGGACTTCCTTCTAAAAATT 

AAATGTTTTGTTTCTTCTTACAAATGGAGATTTTTTTGTACCAGCTTTACCACTTTTCAG 

2521 • 4 + + + + + 2580 

TTTACAAAACAAAGAAGAATGTTTACCTCTAAAAAAACATGGTCGAAATGGTGAAAAGTC 



CCATTTATTAATATGGGAATTTAACTTACTCAAGCAATAGTTGAAGGGAAGGTGCATATT 

+ + + + + + 

GGTAAATAATTATACCCTTAAATTGAATGAGTTCGTTATCAACTTCCCTTCCACGTATAA 

ATCACGGATGCAATTTATGTTGTGTGCCAGTCTGGTCCCAAACATCAATTTCTTAACATG 

+ + + + + ' — + 

TAGTGCCTACGTTAAATACAACACACGGTCAGACCAGGGTTTGTAGTTAAAGAATTGTAC 



L AGCTCCAGTTTACCTAAATGTTCACTGACACAAAGGATGAGATTACACCTACAGTGACTC 

+ + + + + + 2760 

m TCGAGGTCAAATGGATTTACAAGTGACTGTGTTTCCTACTCTAATGTGGATGTCACTGAG 

5 TGAGTAGTCACATATATAAGCACTGCACATGAGATATAGATCCGTAGAATTGTCAGGAGT 

2S61 ————-—+—-——+—-— — --—+—----— — h — r — + 2820 

ACTCATCAGTGTATATATTCGTGACGTGTACTCTATATCTAGGCATCTTAACAGTCCTCA 

GCACCTCTCTACTTGGGAGGTACAATTGCCATATGATTTCTAGCTGCCATGGTGGTTAGG 

2821 + + + + + + 2880 

CGTGGAGAGATGAACCCTCCATGTTAACGGTATACTAAAGATCGACGGTACCACCAATCC 



AATGTGATACTGCCTGTTTGCAAAGTCACAGACCTTGCCTCAGAAGGAGCTGTGAGCCAG 

+ + + + + + 

TTACACTATGACGGACAAACGTTTCAGTGTCTGGAACGGAGTCTTCCTCGACACTCGGTC 



TATTCATTTAAGAGAATTCCACCACACTGGCGGCCCGCGCTTGAT {SEQ ID NO: 3) 

2941 + + + + 2985 

ATAAGTAAATTCTCTTAAGGTGGTGTGACCGCCGGGCGCGAACTA (SEQ ID NO: 30) 
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1 MDSVELCLPE SFSLHYEEEL LCRMSNKDRH IDSSCSSFIK TEPSSPASLT 

51 DSVNHHSPGG SSDASGSYSS TMNGHQNGLD SPPLYPSAPI LGGSGPVRKL 

101 YDDCSSTIVE DPQTKCEYML NSMPKR LCLV CGDIASGYHY GV&SCEACKA 

151 FFKRTIQGNI gYSCPATKEC EITKRRRKSC QACRFMKCLK VGM LKEGVRL 

201 DRVRGGRQKY KRRIDAENSP YLNPQLVQPA KKPYNKIVSH LLVAEPEKIY 

251 AMPDPTVPDS DIKALTTLCD LADRELWI I GWAKHIPGFS TLSLADQMSL 

301 LQSAWMEILI LGWYRSLSF EDELVYADDY IMDEDQSKLA GLLDLNNAIL 

351 QLVKKYKSMK LEKEEFVTLK AIALANSDSM HIEDVEAVQK LQDVLHEALQ 

401 DYEAGQHMED PRRAGKMLMT LPLLRQTSTK AVQHFYNIKL EGKVPMHKLF 

451 LEMLEAKV* (SEQ ID NO:4) 
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1 


GCGGGCCGCC AGTGTGGTGG AATTCGGCTT 


GTCACTAGGA 


GAACATTTGT 


51 


GTTAATTGCA 


CTGTGCTCTG TCAAGGAAAC 


TTTGATTTAT 


AGCTGGGGTG 


101 


CACAAATAAT 


GGTTGCCGGT CGCACATGGA 


TTCGGTAGAA 


CTTTGCCTTC 


151 


CTGAATCTTT 


TTCCCTGCAC TACGAGGAAG 


AGCTTCTCTG 


CAGAATGTCA 


201 


AACAAAGATC 


GACACATTGA TTCCAGCTGT 


TCGTCCTTCA TCAAGACGGA 


251 


ACCTTCCAGC 


CCAGCCTCCC TGACGGACAG 


CGTCAACCAC 


CACAGCCCTG 


301 


GTGGCTCTTC 


AGACGCCAGT GGGAGCTACA 


GTTCAACCAT 


GAATGGCCAT 


351 


CAGAACGGAC 


TTGACTCGCC ACCTCTCTAC 


CCTTCTGCTC 


CTATCCTGGG 


401 


AGGTAGTGGG 


CCTGTCAGGA AACTGTATGA TGACTGCTCC AGCACCATTG 


451 


TTGAAGATCC 


CCAGACCAAG TGTGAATACA 


TGCTCAACTC 


GATGCCCAAG 


501 


AGACTGTGTT 


TAGTGTGTGG TGACATCGCT 


TCTGGGTACC 


ACTATGGGGT 


551 


AGCATCATGT 


GAAGCCTGCA AGGCATTCTT 


CAAGAGGACA ATTCAAGGCA 


601 


ATATAGAATA 


CAGCTGCCCT GCCACGAATG 


AATGTGAAAT 


CACAAAGCGC 


651 


AGACGTAAAT 


CCTGCCAGGC TTGCCGCTTC 


ATGAAGTGTT 


TAAAAGTGGG 


701 


CATGCTGAAA 


GAAGGGGTGC GTCTTGACAG 


AGTACGTGGA 


GGTCGGCAGA 


751 


AGTACAAGCG 


CAGGATAGAT GCGGAGAACA 


GCCCATACCT 


GAACCCTCAG 


801 


CTGGTTCAGC 


CAGCCAAAAA GCCATATAAC 


AAGATTGTCT 


CACATTTGTT 


851 


GGTGGCTGAA 


CCGGAGAAGA TCTATGCCAT 


GCCTGACCCT 


ACTGTCCCCG 


901 


ACAGTGACAT 


CAAAGCCCTC ACTACACTGT 


GTGACTTGGC 


CGACCGAGAG 


951 


TTGGTGGTTA TCATTGGATG GGCGAAGCAT 


ATTCCAGGCT 


TCTCCACGCT 


1001 


GTCCCTGGCG GACCAGATGA GCCTTCTGCA GAGTGCTTGG ATGGAAATTT 
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1051 


TGATCCTTGG 


TGTCGTATAC 


CGGTCTCTTT 


CATTTGAGGA 


TGAACTTGTC 




1101 


TATGCAGACG 


ATTATATAAT 


GGACGAAGAC 


CAGTCCAAAT 


TAGCAGGCCT 




1151 


TCTTGATCTA AATAATGCTA 


TCCTGCAGCT 


GGTAAAGAAA TACAAGAGCA 




1201 


TGAAGCTGGA AAAAGAAGAA 


TTTGTCACCC 


TCAAAGCTAT 


AGCTCTTGCT 




1251 


AATTCAGACT 


CCATGCACAT 


AGAAGATGTT 


GAAGCCGTTC AGAAGCTTCA 




1301 


GGATGTCTTA 


CATGAAGCGC 


TGCAGGATTA 


TGAAGCTGGC 


CAGCACATGG 




1351 


AGAAGACCCT 


CGTCGAGCTG 


GCAAGATGCT 


GATGACACTG 


CCACTCCTGA 




1401 


GGCAGACCTC 


TACCAAGGCC 


GTGCAGCATT 


TCTACAACAT 


CAAACTAGAA 


SSM. 

\ ti 


1451 


GGCAAAGTCC 


CAATGCACAA ACTTTTTTTG 


GAAATGTTGG 


AGGCCAAGGT 


gas 

m 


1501 


CTGACTAAAA 


GCTCCCTGGG 


CCTTCCCATC 


CTTCATGTTG 


AAAAAGGGAA 




1551 


AATAAACCCA AGAGTGATGT 


CGAAGAAACT 


TAGAGTTTAG 


TTAACAACAT 




1601 


CAAAAATCAA 


CAGACTGCAC 


TGATAATTTA 


GCAGCAAGAC 


TATGAAGCAG 


5 ;£ 

r-~ 


1651 


CTTTCAGATT 


CCTCCATAGG 


TTCCTGATGA 


GTTCTTTCTA 


CTTTCTCCAT 


! = g 


1701 


CATCTTCTTT 


CCTCTTTCTT 


CCCACATTTC 


TCTTTCTCTT 


TATTTTTTCT 


s vr 


1751 


CCTTTTCTTC 


TTTCACCTCC 


CTTATTTCTT 


TGCTTCTTTC 


ATTCCTAGTT 




1801 


CCCATTCTCC 


TTTATTTTCT 


TCCCGTCTGC 


CTGCCTTCTT 


TCTTTTCTTT 




1851 


ACCTACTCTC 


ATTCCTCTCT 


TTTCTCATCC 


TTCCCCTTTT 


TTCTAAATTT 




1901 


GAAATAGCTT 


TAGTTTAAAA AAAAAAATCC 


TCCCTTCCCC 


CTTTCCTTTC 




1951 


CCTTTCTTTC 


CTTTTTCCCT 


TTCCTTTTCC 


CTTTCCTTTC 


CTTTCCTCTT 




2001 


GACCTTCTTT 


CCATCTTTCT 


TTTTCTTCCT 


TCTGCTGCTG 


AACTTTTAAA 




2051 


AGAGGTCTCT AACTGAAGAG AGATGGAAGC 


CAGCCCTGCC AAAGGATGGA 
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2101 


GATCCATAAT 


ATGGATGCCA 


GTGAACTTAT 


TGTGAACCAT 


ACCGTCCCCA 




2151 


ATGACTAAGG 


AATCAAAGAG 


AGAGAACCAA 


CGTTCCTAAA AGTACAGTGC 




2201 


AACATATACA AATTGACTGA 


GTGCAGTATT 


AGATTTCATG 


GGAGCAGCCT 




2251 


CTAATTAGAC 


AACTTAAGCA 


ACGTTGCATC 


GGCTGCTTCT 


TATCATTGCT 




2301 


TTTCCATCTA GATCAGTTAC AGCCATTTGA TTCCTTAATT 


GTTTTTTCAA 




2351 


GTCTTCCAGG 


TATTTGTTAG 


TTTAGCTACT 


ATGTAACTTT 


TTCAGGGAAT 




2401 


AGTTTAAGCT 


TTATTCATTC 


ATGCAATACT 


AAAGAGAAAT 


AAGAATACTG 


b 


2451 


CAATTTTGTG CTGGCTTTGA ACAATTACGA ACAATAATGA AGGACAAATG 


~. Z"': 
3 : ;: 
Stf k 


_ 2501 


AATCCTGAAG 


GAAGATTTTT 


AAAAATGTTT 


TGTTTCTTCT 


TACAAATGGA 


sp 

03 


2551 


GATTTTTTTG 


TACCAGCTTT 


ACCACTTTTC 


AGCCATTTAT 


TAATATGGGA 


2601 


ATTTAACTTA 


CTCAAGCAAT 


AGTTGAAGGG 


AAGGTGCATA 


TTATCACGGA 


S 


2651 


TGCAATTTAT 


GTTGTGTGCC 


AGTCTGGTCC 


CAAACATCAA 


TTTCTTAACA 


?!« 

w 


2701 


TGAGCTCCAG 


TTTACCTAAA 


TGTTCACTGA 


CACAAAGGAT 


GAGATTACAC 


Hi 

s.w 


2751 


CTACAGTGAC 


TCTGAGTAGT 


CACATATATA 


AGCACTGCAC 


ATGAGATATA 




2801 


GATCCGTAGA 


ATTGTCAGGA 


GTGCACCTCT 


CTACTTGGGA 


GGTACAATTG 




2851 


CCATATGATT 


TCTAGCTGCC 


ATGGTGGTTA 


GGAATGTGAT 


ACTGCCTGTT 




2901 


TGCAAAGTCA 


CAGACCTTGC 


CTCAGAAGGA 


GCTGTGAGCC 


AGTATTCATT 




2951 


TAAGAGAATT 


CCACCACACT 


GGCGGCCCGC 


GCTTGAT (SEQ ID NO: 5) 
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1 MDSVELCLPE SFSLHYEEEL LCRMSNKDRH IDSSCSSFIK TEPSSPASLT 

51 DSVNHHSPGG SSDASGSYSS TMNGHQNGLD SPPLYPSAPI LGGSGPVRKL 

101 YDDCSSTIVE DPQTKCEYML NSMPKRLCLV CGDIASGYHY GVASCEACKA 

151 FFKRTIQGNI EYSCPATWEC EITKRRRKSC QACKFMKCLK VGM LKEGVRL 

201 DRVRGGRQKY KRRIDAENSP YLNPQLVQPA KKPYNKIVSH LLVAEPEKIY 

251 AMPDPTVPDS DIKALTTLCD LADRELWII GWAKHI PGFS TLSLADQMSL 

301 LQSAWMEILI LGWYRSLSF EDELVYADDY IMDEDQSKLA GLLDLNNAIL 

351 QLVKKYKSMK LEKEEFVTLK AIALANSDSM HIEDVEAVQK LQDVLHEALQ 

401 DYEAGQHMEK TLVELARC* (SEQ ID NO: 6) 
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